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METHODS FOR THE PRODUCTION OF CELLS AND MAMMALS 



Baclcgrouud 

In general, the invention features novel methods for rapidly generating 
cell lines and mammals with site-specific genetic modifications of interest. 



of the research focus has shifted from physical mapping of the genome to 
fimctional annotation of its contents. The challenge Ues in evolving 
comprehensive approaches to efficiently define gene fimctions in vivo. The 
murine system is ideal for fimctional genomics because of the underlying 
15 biological similarity between himian and mouse, the rapid advances in mouse 
genome sequencing, and the ability to genetically manipulate mouse embryonic 
stem (ES) cells. 

Although gene disruption in mice has been widely used for functional 
analyses of genes in vivo, traditional procedures for generating site-specific 

20 gene knockout mice are generally time-consuming and labor-intensive. For 
these traditiorial methods, tEe targeting"consftct lisual^^^ contains 6- 1 0 kb of 
genomic sequence fi'om tiie gene of interest with, e.g., a neomycin-resistant 
(neo") gene inserted into the coding region and a herpes-virus thymidine kinase 
(tk) gene placed at oiie end (Fig. 1 A). The targeting construct is electroporated 

25 into ES cells and replaces the endogenous locus by homologous recombination; 
After verification by genomic DNA Blotting or by polymerase chain reaction 
(PGR) using primers that hybridize to the flanldng region and the dismpted 
gene, ES cells hearing the mutant locus are injected into a mouse blastocyst to 
generate chimera naice. By either method, the flanking sequences of the replica 

30 cannot be too long. Finally, heterozygous and eventually homozygous mutant 
mice are obtained firom the breeding of chimeric animals. 
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As the hximan genome sequencing effort approaches completion, much 
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To make the targeting-constmctoflen-requires fine restriction-ei^^ 
mapping of the gene and multi-step cloning, which is a long and tedions 
process. In order to prevent expression of a partial gene, which can result from 
alternative splicing, the insertion site should be as close to the translation 
5 initiation ATG as possible. This is often hindered by the lack of a convenient 
cloning site around tiie desired region. Furthermore, the length of homologous 
fragments that can be included in the targeting construct is often limited, 
resulting in low homologous recombination frequency. Data from haplotype 
mapping of human populations and studies of meiotio and mitotic 

10 recombination frequency in lower eukaryotes support the idea that favored sites 
for initiation of DNA exchange can be separated by up to several tens of 
kilobases from one another in.euchromatin. If these findings are relevant to- 
mitotic recombination in ES cells, it may be necessary to use veiy long 
flanking sequences to obtain high frequency homologous recombination. 

15 Creating such long flanking arms by conventional cloning is cumbersome and 
frequently impractical. 

Thus, improved methods are needed to more efficiently generate 
genetically modified cell lines and non-human mammals. 

20 Summary of the Invention 

The purpose of the present invention is to provide improved methods for 
genetically altering cells. In particular, these methods involve generating a 
modified artificial chromosome by hcmolpgous^re^ between an . 

artificial chromosome contaimng a nucleic acid of interest (e.g., a bacterial or 

25 yeast artificial chromosome containing a genomic insert from the same genus 
or species as the cell to be subsequently modified) and a nucleic acid (e.g., a 
linear DNA molecule) containing a region or the entire nucleic acid of interest 
with a desired mutation. In some embodiments, the artificial chromosome is a 
bacterial artificial chromosome (BAG) or a PI -based artificial chromosome 

30 (PAC), and the homologous recombination occurs in bacteria, or the artificial 
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chromosome is a yeast artificial chromosome (YAC), and the homologous 
recombination occurs in yeast The resulting modified artificial chromosome 
with the desired mutation is introduced into one or more cells (e.g., mammalian 
cells such as mouse, ungulate, or human cells), and the genetically modified 
5 cells in which homologous recombination occurs between the artificial 

chromosome and corresponding region of an endogenous chromosome in the 
cell are selected. 

In some embodiments, the cell is modified to express a reporter gene 
under the control of an endogenous promoter of interest. In other 

10 embodiments, the cell is modified such that an exogenous nucleic acid 

encoding a detectable protein is operably linked to an endogenous nucleic acid 
encoding a protein of interest, thereby generating a genetically modified cell 
that expresses a fusion protein having the detectable protein and the protein of 
interest or a fragment thereof These modified cells can be used in a variety of 

15 screening assays to identify candidate compounds that modulate the activity of 
the promoter of interest or the expression of the protein of interest. 
Additionally, these cells can be used in any standard method for the generation 
of marranals with one or more desired genetic modifications. These mammals 
are also usefiil in screening assays to identify compounds for the treatment or 

20 prevention of disease. 

Accordingly, in a furst aspect the invention provides a method of 
producing a genetically modified cell. This method involves inserting into one 
or more cells an artificial chromosome (e.g., a Imear BAG, P AC, or YAC) 
having a cassette wliich includes a first region of homology having substantial 

25 sequence identity to a first region of an endogenous chromosome of the cell(s), 
a selectable marker, and a second region of homology having substantial 
sequence identity to a second region of ttie endogenous chromosome. 
Homologous recombination occurs between the artificial chromosome and the 
endogenous chromosome (e.g., homologous recombination between tfie first 

30 region of homology and the first region of the endogenous chromosome and 
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homologous recombination between the second region of homology and the 
second region of the endogenous chromosome), and the cassette is integrated 
into the endogenous chromosome of one or more cells. One or more cells are 
selected in which the homologous recombination occurs, thereby selecting one 
5 or more genetically modified cells. 

In preferred embodiments, the artificial chromosome having the cassette 
is produced by a culturing a cell (e.g. a bacterial or yeast cell) that has (i) a 
linear DNA molecule-having the-cassette and-(ii) an-ardficial chromosome 
having nucleic acid that is substantial identical to the first and second regions 

10 of homology (e.g., a BAG, PAC, or YAC having a genomic insert firom the 
same genus or species as the cells to be subsequently modified) under 
conditions that result in homologous recombination between the linear DNA 
molecule and the artificial chromosome. In particular embodiments for 
generating the artificial chromosome with the cassette, the linear DNA 

15 molecule is introduced into the cell by transformation. In other embodiments, 
the linear DNA molecule is introduced into the cell by insertion of a circular 
vector having the sequence of the linear DNA molecule into the cell and 
cleavage of the vector to generate the linear DNA molecule inside the cell. It is 
also contemplated that the homologous-recombination between the artificial 

20 chromosome and the linear DNA molecvile can occur in an in vitro sample that 
has a recombinase that catalyzes homologous recombination. In preferred 
embodiments, the first and second regions of homology in the linear DNA 
molecule are less than 2,000, 1 ,000, 500, 250, 200, 100, 75, 50, or 25 
nucleotides in length. In other embodiments, the first and second regions of 

25 homology are between 2,000 and 10,000 nucleotides in length, such as between 
2,000 and 5,000 nucleotides or between 5,001 and 10,000 nucleotides in length, 
inclusive. Preferably, the percent sequence identity between the first region of 
homology in the linear DNA molecule and the corresponding region in the 
artificial chromosome and the percent sequence identity between the second 

30 region ofhomology in the linear DNA molecule and.the corresponding region 

4 
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in the artificial chromosome is at leasr90, 92, 94, 96, 98, or 100%. In some 
embodiments the linear DNA molecule has another region 5' to the first region 
of homology and/or another region 3' to the second region of homology. 

In other embodiments, the first and second regions of the endogenous 
5 chromosome are contiguous. In particular embodiments, the first and second = 
regions of the endogenous chromosome are part of the same exon or the same 
promoter. In other embodiments, the first and second regions of the 
endogenous chromosome are not contiguous (e.g., the first and second regions 
of the endogenous chromosome are part of different exons). 

10 In some embodiments, the selectable marker is inserted into an 

endogenous promoter or coding sequence and reduces the expression or 
activity of the protein-^expressed by the nucleic acid: In other embodim^ents, the 
cassette (e.g., the first or second region of homology) has a mutation of interest 
that is incorporated into the endogenous chromosome. Preferably, the 

15 integration of the cassette into the genome of the cell reduces the activity of an 
RNA (e.g., tRNA) or protein encoded by a nucleic acid of interest (e.g., a 
nucleic acid that includes a region located between or located within the first 
and second region of the endogenous chromosome that is mutated by 
integration of the cassette). In preferred embodiments, the amount of 

20 functional protein encoded by the nucleic acid of interest decreases by at least 
5,10, 25, 50, 75, 90, 95, or 100%. 

In various embodiments, the cassette includes a promoter and/or a 
polyadenylation signal sequence operably Hnked to the positive selection 
maker. In other embodiments, the cassette does not contain a promoter and/or 

25 a polyadenylation signal sequence, and the cassette integrates into the genome 
of the cell such that the positive selection naaker is operably linked to an 
endogenous promoter and/or polyadenylation signal sequence for expression of 
the selection marker in the cell. In some embodiments, tlie cassette has two 
positive selection markers; one selection marker for selection of bacteria or 

30 yeast cells witii the artificial chromosome having the cassette, and another 
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selection marker for subsequent selection of cells in which a region of the 
cassette from the artificial chromosome has inserted into the genome. In other 
embodiments, the cassette has one positive selection marker (e.g., a neomycin 
resistance gene) openibly linked to two different promoters, one prompter for 
5 e^qjression of the selection marker in bacteria (e.g., pE7) and another promoter 
for expression of the selection marker in mouse cells such as embryonic stem 
cells (e.g., PGK). 

In otiier preferred embodiments, the cassette includes a reporter gene, 
and the cassette is integrated into the genome of the cell such liiat the reporter 

10 gene is operably linked to an endogenous promoter of interest, thereby 

generating a genetically modified cell that expresses the reporter gene under 
the control of the promoter of interest. Exemplary reporter genes include 
nucleic acids encoding cMorampheiiicoI acetyltiansferase, firefly luciferase, 
renilla luciferase, p-galactosidase, secreted alkaline phosphatase, human 

15 growth hormone, P=glucuroaidase, green.fluorescent protein,_or.red.fluoresce . 
protein. In some embodiments, the method further includes administering one 
or more compounds to the genetically modified cells and selecting a compound 
that alters the expression of the reporter gene. This selected compound is 
usefiil for modulating the activity of the promoter of interest and thus 

20 modulating the expression of endogenous nucleic acids operably linked to the 
promoter. 

In still other preferred embodiments, the cassette includes a nucleic acid 
encoding a detectable protein (e.g., a fluorescent protein or a protein that 
catalyzes a reaction in which the reactant or product is detectable), and the 

25 cassette is integrated into the genome of the cell such that the nucleic acid is 
operably linked to an endogenous nucleic apid encoding a protein of interest, 
thereby generating a genetically modified cell that expresses a fusion protein 
having tiie detectable protein and tlie protein of interest or a fragment thereof- 
In some embodiments, the nucleic acid encoding the detectable protein replaces 

30 a portion of the nucleic acid encoding a protein of interest, thereby generating a 
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fusion nucleic acid that encodes a fusion protein that include^ the detectable ^ 
protein and a fragment of the protein of mterest. In other embodunents, the 
nucleic acid encoding the detectable protein is inserted upstream or 
downstream of the nucleic acid encoding a protein of interest, thereby 
5 generating a fusion nucleic acid that encodes a fusion protein that includes the 
detectable protein and the entire protein of interest. In some embodiments^ the 
method further includes administering one or more compounds to the 
genetically modified cells and selecting a compound that alters the expression 
of the fusion protein. This selected compound is useful for modulating the 

10 expression of the protein of interest 

In desimble embodiments, the steps of the method are repeated, thereby 
generating a genetically modified cell with two or more mutations. The 
mutations may be in differeiit alleles of the same gene or in different genes. In 
some embodiments, each cassette has a recombinase signal sequence, tliereby 

15 generating a genetically modified cell with two recombinase signal sequences. 
Recombination may occur between the recombinase signal sequences in the 
cell. 

In some embodiments, the recombinase signal sequences are in the same 
endogenous chroinosome of "the cell, and recombination between the 

20 recombinase signal sequences results in elimination of the DNA between the 
recombinase signal sequences. In other embodiments, the recombinase signal 
sequences are in different endogenous chromosomes of the cell, and 
recombination between the recombinase signal sequences results in 
chromosomal translocation between the recombinase signal sequences. 

25 In other preferred embodiments of various aspects of the invention, the 

cell is an adult, fetal, or embryonic cell. Examples of preferred cells include 
undifferentiated cells such as embryonic cells (e.g., embryonic stem cells or 
embryonic germ cells) and differentiated or somatic cells such as epithelial 
cells, neural cells, epidermal cells, keratinocytes, hematopoietic cells, 

30 melanocytes, chondrdcytes,'B-lympHocytes, T-lymphdcytes, erythrocytes. 



wo 2004/013299 



PCT/US2003/024322 



macrophages, monocytes, fibroblasts, and muscle cells. In another preferred 
embodiment, the cell is fi:om the female reproductive system, such as a 

mammary gland, ovarian cumulus, granulosa, or oviductal cell. Other 

•■*. 

preferred cells include germ and placental cells. Preferred cells also include 
5 those from any organ, such as the bladder, brain, esophagus, fallopian tube, 
heart, intestines, gallbladder, kidney, liver, lung, ovaries, pancreas, prostate, 
spinal cord, spleen, stomach, testes, thymus, thyroid, trachea, ureter, urethra, 
and uterus. 

The genetically modified cells produced by any of the methods of the 

10 invention can be used as donor cells for the production of non-human mammals 
with one or-more-desired.genetic.modifications. Jn.particular,.fhese can 
be used in any standard method for the generation of chimeric or cloned 
mammals. In some embodiments, a cell is inserted into an embryo (e.g., a 
blastocyst) or fetus for the generation of a chimeric mammal that contains some 

15 cells with the genetic mutation(s) and some cells without the mutation(s). In 
other embodiments, a cell or a nucleus fiom the cell is inserted into an oocyte 
for the generation of a cloned mammal in which most or all of the cells have 
the genetic mutation(s). 

Accordingly, in one such aspect, the invention provides a method of 

20 producing a genetically modified non-human mammal (e.g., a mouse) that 
involves inserting a cell produced by a method of the invention into a non- 
human.embryo under c ondi tions that allo w t he em bryo to develop into a fetus. 

In a related aspect, the invention provides another method of producing 
a genetically modified non-human mammal (e.g., an ungulate). This method 

25 involves inserting a c^U produced by a method of the invention or a nucleus 
from the cell into an oocyte. The oocyte or an embryo formed from the oocyte 
is transferred into the uterus of a host mammal under conditions that allow the 
oocyte or the embryo to develop into a fetus. 
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In preferred embodiments of any of the above aspects for generating 
mammals, the fetus develops into a live offspring. In some embodiments, one 
or more cells are isolated for the fetus or live offspring, and one or more 
additionalTnutatioris are inlro^^ 
5 used to generate mammals with these additional modifications. The metiiods 
may also involve mating two of the offspring to generate a mammal with a 
homozygous mutatibn or a mammal with a mutation in two or more genes. 

In some embodiments, the donor cell has two recombinase signal 
sequences and recombination occurs between the recombinase signal sequences 

10 in cells of a predetermined cell type of the fetus or a live offspring formed from 
the fetus. For example, the donor cell may be genetically modified to encode a 
recombinase under the control of a promoter specific for the predetermined cell 
type such that the recombinase is only expressed and recombination only 
occurs in that cell type of the fetus or offspring. 

15 — The mammals produced by the above methods can be used in various 
screening assays to identify a candidate compound that modulates the 
expression of a nucleic acid or protein of interest or that is usefiil for the 
treatment or prevention of a disease. For example, a candidate compound can 
be administered to a mammal genetically modified to express a reporter gene 

20 under the control of a promoter of interest or to express a fusion protein that 
includes a detectable protein and a protein of interest. The expression levels of 
the reporter gene or fusion protein can be measured to determine if the 
candidate compoimd modulates their expression in vivo. Such conipounds may 
be usefiil for the treatment of a disease associated with a nucleic acid operably 

25 Hnked to the promoter of interest or associated with the protein of interest 

-Additionally^ genetically modified mammals having-a mutation associated with- 
a disease can be used as animal models for the study of the disease. 
Compounds can be administered to these animal models to determine whether 
they ameliorate or prevent a symptom or other physiological effect associated 

30 with the disease. 
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Accordingly, the invention provides a screening method for determining 
whether a candidate compoimd modulates the expression of a nucleic acid of 
interest. This method involves administering a candidate compound to a 
mammal (e.g., a mammal that has a genetic modification in a nucleic acid of 
5 interest or in a promoter operably linked to a nucleic acid of interest) produced 
by a method of the invention, and measuring the expression of a nucleic acid. 
The candidate compound is determined to modulate expression of the nucleic 
acid if the candidate compound causes a change in expression of the nucleic 
acid. In some embodiments, measuring the expression of the nucleic acid 

10 involves measuring the expression of an mRNA corresponding to the nucleic 
acid or the expression of a protein encoded by the nucleic acid. Preferably, the 
mammal is genetically modified to express a reporter gene operably linked to a 
promoter of interest (e.g., a promoter that modulates the expression of a nucleic 
acid associated with a disease). In other embodiments, the mammal is 

15 genetically modified to express a fusion protein tliat includes a detectable 
protein and a protein of interest or a fragment thereof. In desirable 
embodiments, the protein of interest is associated with a disease. Exemplary 
candidate compounds include, for example, proteins, sj^thesized organic 
molecules, naturally occurring organic molecules, nucleic acid molecules, and 

20 components thereof. In some embodiments, the candidate compoxmd is a 

member of a library of at least 5, 10, 25, 50, 100, 500, or more compounds that 
are simultaneously administered to the mammal. 

In another aspect, the invention features a screening method for 
determining whether a candidate compound is useful for the treatment, 

25 stabilization, or prevention of a disease, disorder, or condition. This method 
involves administering a candidate compoiind to a manmaal produced by a 
method of the invention, and measuring one or more symptoms associated with 
the disease, disorder, or condition. The candidate conapound is determined to 
be useful for the treatment, stabilization, or prevention of the disease, disorder, 

30 or condition if tiie candidate compound reduces, stabihzes, or prevents flie 

10 
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symptom. In desirable embodiments, the genetically modified cell used to 
generate the mammal has a mutation associated with a disease, and the 
mammal has one or rnore phenotypes or symptoms associated with the disease. 
In some embodiments, the mammal has a mutation, chromosomal deletion, or 
5 translocation associated with cancer. Exemplary candidate compounds include, 
for example, proteins, synthesized organic molecules, naturally occurring 
organic molecules, nucleic acid molecules, and components thereof. In some 
embodiments, the candidate compound is a member of a library of at least 5, 
10, 25, 50, 100, 500, or more compounds that are simultaneously administered 

10 to the mammal. 

The invention also provides a metho d fo r determining whether a nucleic 
acid of interest is associated with a disease, disorder, or condition. This 
method involves measuring one or more symptoms associated with the disease, 
disorder, or condition in a mammal that has a mutation in a nucleic acid of 

15 interest and that is produced by a method of the invention. The nucleic acid is 
determined to be associated with the disease, disorder, or condition if the 
symptom differs between the mammal and a control mammal without the 
mutation. In some embodiments, a mutation is introduced into an endogenous 
promoter to increase the expression of a nucleic acid of interest Exemplary 

20 nucleic acids include nucleic acids that are thought to promote cancer such as 
possible oncogenes; genes that enhance cell proliferation, invasion, or 
metastasis; genes that inWWt apoptosisj^a^^^ 

The genetically modified cells produced using the methods of the 
invention can also be administered to a naammal for the treatment, stabiUzation, 

25 or prevention of a disease associated with a deficiency of fimctional cells of a 
particular cdll-type or associated with a mutation that is not present in the 
genetically modified cells. For example, the cells can be modified to replace a 
nucleic acid sequence that is associated with a disease with a nucleic acid 
sequence that is not associated with a disease. 
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Accordingly, in one aspect, the invention provides a method of treating, 
stabilizing, or preventing a disease, disorder, or condition in a mammal (e.g., a 
human). This method involves administering one or more cells (e.g., purified 
or unpurified mammalianor-humanceUs) producedby amethodof the 
5 invention to a mammal in an amount sufficient to treat, stabilize, or prevent the 
disease, disorder, or condition. In some embodiments, the mammal has a 
mutation associated with a disease that is not present in the administered cells 
(e.g., cells that have been genetically modified to eliminate a mutation 
associated with a disease). Preferably, a disease-causing mutation in a 

10 regulatory region, promoter, untranslated region, or coding region of a gene in 
the cells is modified to replace the mutant sequence with a sequence that is not 
associated with the disease. Examples of mutations that may be rescued using 
these methods include dominant or recessive mutations (e.g., mutations in the 
cystic fibrosis gene or a gene encoding a human clotting factors such as any of 

15 factors rto-XIH) (Voet and Voety Bioeheroistiyv Joto^^ 

York, 1990). In other embodiments, the mammal has a deficiency in the 
number or activity of cells of a certain cell type. 

In some embodiments of the therapeutic methods of the invention, the 
transplanted cells are genetically modified to introduce a mutation in a 

20 promoter or regulatory region that increases the expression of an operably 
linked nucleic acid that encodes a protein that prevents or ameliorates cancer. 
Cancer related genes that inhibit cancer include, but are not limited to, tumor 
suppressor genes; genes that inhibit cell proliferation, invasion, or metastasis; 
genes that promote apoptosis; and anti-angiogenesis genes. Exemplary cancers 

25 include prostate cancers, breast cancers, ovarian cancers, pancreatic cancers, 
g£istHc cancersrbladder-cancers, salivaiy-gland carcinomas,- gastrointestinal 
cancers, lung cancers, colon cancers, melanomas, brain tumors, leukemias, 
lymphomas, and carcinomas. Benign tumors may also be treated or prevented 
using the methods and cells of the present invention. 

12 
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With respect to the therapeutic methods of the invention, it is not 
intended that the administration of genetically modified cells to a mammal be 
limited to a particular mode of administration, dosage, or frequency of dosing; 
the present invention contemplates all modes of administration, including 
5 intramuscular, intravenous, intraarticular, intralesional, subcutaneous, or any 
other route sufficient to provide a dose adequate to prevent or treat a disease. 
Preferably, the genetically modified cells are adn^nistered to the mammal from 
which the cells are obtained. Alternatively, tiie cdk may be obtained from a 
different donor mammal of tiie same or a different genus or species as the 

10 recipient mammal. Examples of preferred donor mammals include humans, 
cows, sheep, big-horn sheep, goats, buffalos, antelopes, oxen, horses, donkeys, 
mule, deer, elk, caribou, water buffalo, camels, llama, alpaca, rabbits, pigs, 
mice, rats, guinea pigs, hamsters, dogs, cats, and primates such as monkeys. 
The cells may be administered to the mammal in a single dose or multiple 

15 doses. When multiple doses are administered, the doses may be separated fiom 
one another by, for example, one week, one month, one year, or ten years. One 
or more immunosuppressive agents, such as cyclosporin, may be administered 
to inhibit rejection of the transplanted cells. It is to be understood that, for any 
particular subject, specific dosage regimes should be adjusted over time 

20 according to the individual need and the professional judgment of the person 
administering or supervising the administration of the compositions. If desired, 
conventional treatmeiits may be used in combination with the genetically 
modified cells of the present invention. 

In preferred embodiments of any of the various aspects of the invention, 

25 the genetically modified cell has a mutation that alters the expression level or 
activity of one or more mRNA or protein molecules by at least 2, 5, 10, or 20- 
fold, as measured using standard assays (see, for example, Ausubel et aly 
Current Protocols in Molecular Biology, John Wiley & Sons, New York, 
2000). 



13 



wo 2004/013299 



PCT/US2003/024322 



The cells and oocytes used in these methods may be firom the same 
species, or they may be from different species or genuses. In preferred 
embodiments, the oocyte is an enucleated or nucleated non-himian oocj^e. In 
addition, the genomic DNA of the cloned embryo, fetus, or mammal is 
5 preferably substantially identical to that of the donor cell. In other 

embodiments, the donor cell is inserted into an embryo for the production of a 
chimeric embryo, fetus, or mammal containing a mixture of (i) cells with DNA 
substantially identical to that of the genetically modified donor cell and (ii) 
cells with DNA substantially identical to that of the naturally-occurring cells in 

10 the embryo. Preferred non-human mammals and preferred sources of cells 
include rodents, such as mice and rats. Examples of other preferred mammals 
and preferred sources of cells include cows, sheep, big-horn sheep, goats, 
buffalos, antelopes, oxen, hoi-ses, donkeys, mule, deer, elk, caribou, water 
buffalo, camels, llama, alpaca, rabbits, pigs, guinea pigs, hamsters, and 

15 primates such as monkeys. In some embodiments, the mammal is a murine or 
an ungulate such as a bovuie, ovine, porcine, or caprine. 

As used herein, by "artificial chromosome" is meant a chromosome or 
fragment thereof which has an artificial (i.e., non-naturally occurring or 
engineered) modification such as the addition of a selectable marker, the 

20 addition of a cloning site, the deletion of one or more nucleotides, the 

substitution of one or more nucleotides, or the like. By '^bacterial artificial 
chromosome (BAG)** is meant an artificial chromosome generated from a low 
copy plasmid capable of faithfiilly replicating large chromosomal segments. 
Similarly, by "yeast artificial chromosome (YAC)" is meant an artificial 

25 chromosome generated from one or more yeast chromosomes. By "P 1 -based 
artificial chromosome (PAC)" is meant an artificial chromosome generated 
from the latent replicon of phage PI . If desired, two or more artificial 
chromosomes can be introduced into a cell (e.g., a bacteria or yeast) 
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simultaneously or sequentially using standard methods (see, for example, 
Ausubel et aL, Current Protocols in Molecular Biology, John Wiley & Sons, 
New York, 2000). 

By "donor cell" is meant a cell-firom which genetic material-(e.g,, the 
5 nucleus or the entire cell) is inserted into an oocyte, embryo, or fetus for the 
generation of a mammal with a genetic modification. 

By "purified" is meant separated frpm other components that naturally 
accompany it. Typically, a factor or cell is substantially pure when it is at least 
50%, by weight, free firom proteins, antibodies, naturally-occurring organic 

10 molecules, and cells with which it is naturally associated. Preferably, the factor 
or cell is at least 75%, more preferably, at least 90%, and most preferably, at 
least 99%, by weight, pure. A substantially pure factor may be obtained by 
chemical synthesis, separation of the factor from natural sources, or production 
of the factor in a recombinant host cell that does not naturally produce the 

15 factor. Proteins, -vesicles,-organelles,-and cells may be-purijSed by one skilled - 
in the art using standard techniques such as those described by Ausubel et ah 
(Current Protocols in Molecular Biology, Jolui Wiley & Sons, New York, 
2000). The factor or cell is preferably at least 2, 5, or 10 times as pure as the 
starting material, as measured using polyacrylamide gel electrophoresis, 

20 coluimi chromatography, optical density, HPLC analysis, or western analysis 
(Ausubel et ai, supra). Preferred methods of purification include 
immunoprecipitation, column chromatography such as immunoaffinity 
chromatography, magnetic bead immunoaffinity purification, paiming witli a 
plate-bound antibody, and cell sorting. 

25 By "viable offspring" is meant a mammal that survives ex utero. 

Preferably,-the mammal is alive_for at least„one. second,-one.minute,_one.hour, 
one day, one week, one month, six months, or one year from the time it exits 
tiie maternal host The mammal does not require the circulatory system of an 
in utero environment for survival. 



wo 2004/013299 PCTAJS2003/024322 

By "embryo" or "embryonic" is meant a developing cell mass that has not 
implanted into the uterine membrane of a maternal host. Hence, the term 
"embryo" may refer to a fertilized oocyte; an ooc3rte containing a donor nucleus 
or cell; a pre-blastocyst stage developing cell mass; or any other developing 
5 cell mass that is at a stage of development prior to implantation into the uterine 
membrane of a naatemal host and prior to formation of a genital ridge. An 
embryo may represent multiple stages of cell development For example, a one 
cell embryo can be referred to as a zygote; a solid spherical mass of cells 
resulting from a cleaved embryo can. be referred to as a morula, and an embryo 

10 having a blastocoel can be referred to as a blastocyst. An "embryonic cell" is a 
cell isolated from or contained in an embryo. 

By "fetus" is meant a developing cell mass that has implanted into the , 
uterine membrane of a maternal host. A fetus may have defining features such 
as a genital ridge, which is easily identified by a person of ordinary skill in the 

15 art. A "fetal cell" ijS any cell isolated from or contained in a fetus. 

By **mutation" is meant an alteration in a naturally-occurring or 
reference nucleic acid sequence, such as an insertion, deletion, frameshift 
mutation, silent mutation, nonsense mutation, or missense mutation. 
Preferably, tlie amino acid sequence encoded by the nucleic aad'seqiience Eais " 

20 at least one amino acid alteration from a naturally-occurring sequence. 

By "substantially identical" is meant having a sequence that is at least 
60, 70, 80, 90, or 100% identical to that of another sequence. Sequence 
identity is typically measured using sequence analysis software with the default 
parameters specified therein (e.g., Sequence Analysis Software Package of the 

25 Genetics Computer Group, University of Wisconsin Biotechnology Center, 
1710 University Avenue, Madison, WI 53705). BLAST, BESTFIT, GAP, or 
PILEUP/PRETTYBOX programs). Such software matches identical or similar 
sequences by assigning degrees of homology to various substitutions, deletions, 
and/or other modifications. Conservative substitutions typically include 

30 - 
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substitutions within the following groups; glycine, alanine; valine, isoleucine, 
leucine; aspartic acid, glutamic acid, asparagine, glutamine; serine, threonine; 
lysme, arginine; and phenylalanine, tyrosine. 

A substentially identical sequence is also one that hybridizes with another 
5 sequence under low stringency or high stringency conditions. For example, 
stringent salt concentration will ordinarily be less than about 750 mM NaCl and 
75 mM trisodium citrate, preferably less than about 500 mM NaCl and 50 mM 
trisodium citrate, and most preferably less than about 250 mM NaCl and 25 
uiM trisodium citrate. Low stringency hybridization can be obtained in the 

10 absence of organic solvent, e.g., formamide, while high stringency 

hybridization can be obtained in the presence of at least about 35% formaniide, 
and most preferably at least about 50% formamide. Stringent temperature 
conditions will ordinarily include temperatures of at least about 30°C, more 
preferably of at least about 37^0, and most preferably of at least about 42°C. 

15 Varying additional parameters, such as hybridization time, the concentration of 
detergent, e.g., sodium dodecyl sulfate (SDS), and the inclusion or exclusion of 
carrier DNA, are well known to those skilled in the art. Various levels of 
stringency are accomplished by combining these various conditions as needed. 
In a preferried embodiment, hybridization will occur at 30°C in 750 mM NaCl, 

20 75 mM trisodium citrate, and 1% SDS. In a more preferred embodiment, 
hybridization will occur at 37°C in 500 mM NaCl, 50 mM trisodium citrate, 
1% SDS, 35% formamide, and 100 ^ig/ml denatured salmon sperm DNA 
(ssDNA). In a most preferred embodiment, hybridization will occur at 42°C in 
250 mM NaCl, 25 mM trisodium citrate, 1% SDS, 50% formamide, and 200 

25 |ig/ml ssDNA. Useful variations on these conditions will be readily apparent to 
those skilled in the art. 

For most applications, washing steps that follow hybridization will also' 
vary in stringency. Wash stringency conditions can be defined by salt 
concentration and by temperature. As above, wash stringency can be increased 

30 by decreasing salt concentration or by increasing temperature. For example. 
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Stringent salt concentration for the wash steps will preferably be less than about 
30 mM NaCl and 3 mM trisodium citrate, and most preferably less than about 
1 5 mM NaCl and 1 .5 mM tiisodium citrate. Stringent temperature conditions 
for the wash-steps will ordinaTily-iuclude-a-temperature of at-least about-25?C, - 
5 more preferably of at least about 42°C, and most preferably of at least about 
68*^C. In a preferred embodiment, wash steps will occur at 25°C in 30 mM 
NaCl, 3 mM trisodium citrate, and 0.1% SDS, In a more preferred 
embodiment, wash steps will occur at 42°C in 15 mM NaCI, 1.5 mM trisodium 
citrate, and 0.1% SDS. In a.most preferred embodiment, wash steps will occur 

10 at 68^C in 15 mM NaCl, L5 mM trisodium citrate, and 0.1% SDS. Additional 
variations on these conditions will be readily apparent to those skilled in the 
art. Hybridization techniques are well known to those skilled in the art and are 
described, for example, in Benton and Davis {Science 196:180, 1977); 
Grunstein and Hogness {Proc. Natl Acad. ScL, USA 72:3961, 1975); Ausubel 

45 Qt-al. (Current Protocols m-MoleculaKBiology^^^ 

2001); Berger and Kimmel {Guide to Molecular Cloning Techniques^ 1987, 
Academic Press, New York); and Sambrook et al.. Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Laboratory Press, New York. 

By "treating, stabilizing, or preventing a disease, disorder, or condition" 

20 is meant preventing or delaying an initial or subsequent occurrence of a 
disease, disorder, or condition; increasing the disease-free survival time 
between the disappearance of a condition and its reoccurrence; stabilizing or 
reducing an adverse symptom associated with a condition; or inhibiting or 
stabilizing the progression of a condition. Preferably, at least 20, 40, 60, 80, 

25 90, or 95% of the treated subjects have a complete remission in which all 

evidence of the disease disappears. In _another_pr(Bferred embpdimeirt^ . 
length of time a patient survives after being diagnosed with a condition and 
treated with a cell of the invention is at least 20, 40, 60, 80, 100, 200, or even 
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500% greater than (i) the average amount of time an untreated patient survives , 
or (ii) the average amount of time a patient treated with another therapy 
survives. 

By "treating, stabilizing, or preventing cancer'* is meant causing a 
5 reduction in the size of a tumor, slowing or preventing an increase in the size of 
a tumor, increasing the disease-firee survival time between the disappearance of 
a tumor and its reappearance, preventing an initial or subsequent occurrence of 
a tumor, or reducing or sfabilizihg an adw 

tumor. In one preferred embodiment, the percent of cancerous cells surviving 

10 the treatment is at least 20, 40, 60, 80, or 100% lower than the initial number of 
cancerous cells, as measured using any standard assay. Preferably, the 
decrease in the number of cancerous cells induced by administration of a cell of 
the invention is at least 2, 5, 10, 20, or 50-fold greater than tlie decrease in tlie 
number of non-cancerous cells. In yet another preferred embodiment, the 

15 number of cancerous cells present after administration of a cell is at least 2, 5, 
10, 20, or 50- fold lower than the number of cancerous cells present after 
administration of a vehicle control. Preferably, the metliods of the present 
invention result in a decrease of 20, 40, 60, 80, or 100% in the size of a tumor 
as determined using standard me^^^ 40, 60, 80, 90,~ 

20 or 95% of the treated subjects have a complete remission in which all evidence 
of the cancer disappears. Preferably, the cancer does not reappear or reappears 
after at least 5, 10, 15, or 20 years. In another prefeixed embodiment, the 
length of time a patient survives after being diagnosed with cancer and treated . 
with a cell of the invention is at least 20, 40, 60, 80, 100, 200, or even 500% 

25 greater than (i) the average amount of time an untreated patient survives or (ii) 
the average amount of time a patient treated with another therapy survives. 

The present invention provides a number of advantages related to the 
cloning of mammals. For example, the methods decrease the time and labor 
required to generate genetically modified donor cells which can be used in 

30 standard methods to generatedoned or chimeric mammals for medical, 
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research, or agricultural applications. In particular, only small regions of 
homology from a nucleic acid of interest (e.g., regions that are only 50 
nucleotides in length) need to be included in the linear DNA molecule for this 
molecule to homologously recombine with an artificial chromosome to 
5 generate a modified an artificial chromosome with a genetic modification of ' 
interest The presence of a large genomic insert in tixe artificial chromosome, 
which is homologous (e.g., at least 90, 92, 96, 98, or 100% identical) to a 
region of an endogenous cliromosome in a cell to be modified, significanfly 
increases the efficiency of homologous recombination between the modified 

10 artificial chromosome and the endogenous chromosome and thus reduces the 
number of transfected cells that need to be screened for the desired homologous 
recombination event. Due to the relative ease of producing these donor cells, a 
large number of mammals witiii different mutations can be generated 
simultaneously. These methods can be generally applied to inti'oduce any 

15 desired genetic modification, such as a knockout modification, conditional 
knockout modification, knock-in modification, chromosomal deletion, or 
chromosomal translocation. 

Cells that are modified to correct an undesired mutation (e.g., a mutation 
associated with a disease) can also be rapidly generated for clinical 

20 applications. Because the sequence of an endogenoiis nucleic acid is modified 
in the cells, these cell transplantation methods may be more effective for long- 
temi expression of the modified nucleic acid than conventional gene therapy 
techniques, which may be limited by positional ejffects due to site of integration 
of a transgene. Moreover, cells that are transplanted into the mammal from 

25 which the donor cells were obtained are unlikely to express foreign antigens 
and thus are. unlikely to induce an adverse inunune response that results in 
rejection of the transplanted cells. 

Other features and advantages of the invention will be apparent from the 
following detailed description and from the claims. 

20 
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Brief Description of tlie Drawings 
The application file contains drawings executed in color (Figs. IB, IC, 
IE, 4C, 5B, 6B, and 17B). Copies of this patent or patent application with 
color drawings will be jorovided by the. Office upon request and pajonent of the 
5 necessary fee. 

Figure 1 A is a schematic diagram of a traditional method for generating 
knockout mice. Figures IB and IC are schematic diagrams of vectors pSKY 
and pBAD?^redap. Figures ID and IE are schematic diagrams of an 
exemplary procedure for generating a modified BAG. Step 1 involves 
1 0 transforming the pBADXredaP plasmid into BAG host strain and inducing the 
expression of ap genes in the process of making electroporation competent 
ceils. Step 2 involves electroporating the linear substrate and selecting for 
zeocin/chloraniphenicol double resistant colonies. Step 3 involves identifying 
homologous recombination events by nested PGR, and step 4 involves 

-15 exti-acting- DNA. -Step-5 involves electroporating-the mixed-DNA into DH-1 OB 

cells and selecting for chloramphenicol resistant and ampicillin sensitive 
colonies. 

Figure 2 A is a schematic diagram of the generation of a faficg/xrcc9 
knockout construct that illustrates the stioictures of the wild-type gene (WT), 

20 the lineal- substrate (TG), and the mutant gene (MT). The black blocks 

represent homologous sequences. The open blocks represent exons. Figure 2B 
are pictures of gels of nested PGR reaction mixtures that show correct 
integration at 5 'and 3' sites and deletion of exons 5 and 6, Figure 2C is a 
picture of a gel of a restriction digestion of DNA from wild-type (WT) and two 

25 modified BACs (1 & 2) by Spel and EcoRV. Figure 2D is a picture of a DNA 
blot pf the gel in Fig, 2C_ with the probe, indicated in Fig..2A. 

Figure 3 A is a schematic diagram of the generation of a tiap/survine 
conditional knockout construct that shows the structures of the wild-type gene 
(WT), intermediate recombinants (Steps 1 and 2), and the final product (Step 

21 
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3). Open blocks represent exons. Figure 3B is a picture of a gel verifying , 
homologous recombination by nested PGR. The primers used in each step are 
indicated on the right. 

Figures 4A-4C are schematic diagrams of strategies for screening 
5 targeted ES cells. Homologous recombination can be identified by genomic 
PGR and DNA blotting when using a conventional targeting construct (Fig. 
4A), or by competitive PGR and FISH when using an intact BAG (Figs. 4B and 
4C). Gray blocks represent small homologous regions. Broken gray blocks 
represent large homologous regions. Black blocks represent BAG vector 
10 sequences. Downward anows indicate restriction endonuclease cleavage sites. 
"WT" denotes the wild-type locus; "TG" denotes the targeting construct; and 
"MT" denotes the mutant locus. 

Figures 5A and 5B illustrate the identification offancg/xrccP-taTgctcd 
ES cells. Fig. 5 A is a picture of a gel showing tibte PGR analysis of genomic 
15 DNA extracted from clones G38, C52, and C68 and from untargeted ES cells. 
BAG plasmid DNA (BAG) serves as positive control. PGR fragments from the 
5' and 3' ends of the BAG vector and the internal controls are indicated on the 
left. Fig. 5B is a picture of the FISH analysis of cell lines in Fig. 5A. 

Figures 6A and 6B illustrate the identification of t/o^3A/o^-i?-targeted 
20 ES cells. Fig. 6A is a picture of a gel showing the PGR analysis of genomic 
DNA extracted from indicated clones and untargeted ES cells. Fig. 6B is a 
picture of the FISH analysis of cell lines in Fig. 6A. 

Figures 7A and 7B illustrate the generation of knockout mice. Fig. 7A 
is a picture of gels showing the genotyping of mouse tail genomic DNA by 
25 PGR with primers ampHfying wild-type (WT) locus and mutant (MT) locus. 
Fig. 7B is a picture of gels showing flie RT-PCR analysis of mRNA extracted 
firom mouse spleenocytes with primers amplifying exons 5 and 6 fi:om 
fancg/xrccP, exon 2 from dok3/dok-L, and exon 2 firom iKprasl to detect the 
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presence of wild-type gene transcripts. Murine P2 microglobulin (for 
fancg/xrcc9) and caspase 8 noRNAs (for dokS/dok-L and iKPrasI) were used as 
internal controls. 

Figure 8 A is a schematic diagram of the generation of a 
5 /Ar/Sra57knockout construct that illustrates the structures of the wild-type gene 
(WT), the linear substiate (TG), and the mutant gene (MT). The homologous 
sequences are labeled as 5' arm and 3' arm. The hatched arrows are FRT sites. 
The open blocks represent exons. Figure 8B is a picture of a gel of nested PGR 
reaction mixtures that show correct integration at of the 5'and 3'ends of the 
10 antibiotic resistance cassette. Figure 8C is a picture of a gel of a restriction 
digestion of DNA from wild- type (WT)_and ruodifi^^^ Figure SD is a 

picture of a DNA blot of the gel in Fig. 8C with the probes indicated in Fig. 
8A. 

Figure 9 A is a schematic diagram of the generation of a dokS knockout 
15 construct that illustrates the structures of the wild-type gene (WT), the linear 
substrate (TG), and the mutant gene (MT). The homologous sequences are 
labeled as 5' arm and V arm. The hatched arrows are FRT sites. Tlie open 
blocks represent exons. Figure 9B is a picture of a gel of nested PGR reaction 
mixtures that show correct integration at of ttie 5'and 3 ' ends of the antibiotic 
20 resistance cassette. Figure 9C is a picture of a gel of a restriction digestion of 
DNA from wild-type (WT) and modified BACs. Figure 9D is a picture of a , 
DNA blot of the gel in Fig. 9C with the^probe indicated in Fig. 9A. 

Figure 10 is a table illustrating the modification of BACs by 
homologous recombination in E. colL 
25 Figures 1 1 A and 1 IB illustrate the generation of the fancg/xrcc9 

knockout construct The structures of the wild type gene (WT) and the mutant 
gene (MT) are depicted in Fig 11 A. Gray blocks represent homologous 
sequences; open blocks represent exons. Black arrows are Spel sites; open 
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arrows are EcdKV sites. PCR results using primers indicated in Fig. 1 1 A show 
correct integration at the 5' and the 3' sites and deletion of exons 5 and 6 (Fig 
IIB). 

FigureS"l-2A-l-2G illustrate the targeting of ES cells by intact B ACsr 
5 Fig. 12A depicts the screening for homologous recombination by PCR and 
FISH. Broken gray blocks represent laige homologous regions; black blocks 
represent BAG vector sequences. "WT" denotes the wild type locus; "MT" 
denotes the mutant locus. PCR analysis of genomic DNA extracted from 
FANCG/XRCC9 targeted ES clones C38, C52, C68, and untargeted ES cells is 

10 shown in Fig. 12B. B AC DNA was used as positive control. PCR fragments 
from the 5' and the 3' of BAC vector and the internal controls are indicated on 
the left. Blot hybridization of genomic DNA extracted from indicated clones 
and untargeted ES cells using probes indicated in Fig. 12A and a probe specific 
for zeo^/neo^ cassette is shown in Fig. 12C. 

15 Figures-13A-13G illustrate the generation of knockout mice; " 

Genotyping of mouse tail genomic DNA by PGR with primers amplifying the 
wild type (WT) locus and the mutant (MT) locus is shown in Fig. 13 A. RT- 
PCR analysis of mRNA extracted from mouse spleenocytes with primers 
amplifying wild type gene transcripts and primers amplifying coding regions of 

20 either P2 microbulin (300 bp) or caspase-8 (500 bp) mRNAs as internal 

controls is shown in Fig. 13B. Analysis of genomic markers within 100 kb of 
the targeting site is depicted in 13C. Distance of the markers from the targeting 
site and the sizes of the PCR fragments are indicated Some markers (M9 for 
fancg/xrcc9 locus and M3 for iKpras-1 locus) are absent in corresponding 

25 BACs, suggesting that these markers are outside of the genomic inserts of tiie 
-BAGs. . _ . 

Figure 14 is a table sunmiariziag gene-targeting efficiency. 
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Figure 1 5 illustrates a genomic PGR genotyping of Co^/c-deficient BS • 
clones. Clone 3 1 shows both the wild type locus (WT) and the BAG (MT) 
smce it is the result of non-homologous integration of the BAG containing the 
modified cask locus. Clones 38, 49, and 57 are the result of homologous 
5 recombination events and only contain the modified locus (MT). 

Figure 16 illustrates a genomic PGR genotyping of magiun-l 'deficient 
ES clones. Clone 51 is the result of non-homologous integration of the BAG 
containing the modified cask locus and therefore shows both the wild type 
locus (WT) and the BAG (MT). Clone 50 is the result of homologous 
10 recombination events and only contains the modified locus (MT). 

Figuires 17A-i7C illustrate the generation of conditional tiap knockout 
mice. Figure 17A is a schematic illustrating the sequential insertion of FRT 
sites into designated int ons. Figure 17B illustrates a FISH analysis of three ES 
clones. Clones A8 and B6 are the result of homologous recombination events 
15 and Clone B4 is the result of a random integration event. Figure 17C 

illustratres that all the FRT sites have been faithfully integrated into the ES 
genome. . 



' " "Detailed Description " " ~ - - — 
20 To overcome difficulties associated with conventional site-specific 

modification methods, we have developed a method that uses modified 
artificial chromosonies (e.g., B AGs, PAGs, or YACs) that contain a genetic 
modification of interest as targeting constructs. The large amount of sequence . 
that is homologous between the artificial chromosome and an endogenous 
25 chromosome of the targeted cells greatly increases the efficiency of 

homologous recombination between the region of the artificial chromosome 
containing the genetic modification of interest and the corresponding region of 
the endogenous chromosome. For example, murine BACs contain 100-150 
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kilobases of mouse geaomic sequence that is substantially identical to an 
endogenous chromosome of a targeted mouse cell, such as an embryonic stem 
(ES) cell. 

For these methods, an artificial chromosome containing a genomic insert 
5 (e.g., a region of a mouse chromosome) with a nucleic acid of interest is 

obtained from a commercial source or constructed using standard methods. A 
desired genetic modification is introduced into the nucleic acid of interest in the 
artificial chromosome by homologous recombination between the artificial 
chromosome and a linear DNA molecule containing a positive selection marker 

10 (e.g., a zeo^/neo^ marker) flanked by two regions of homology to the nucleic 
acid of interest. In some cases, the desired modification is tiie insertion of the 
positive selection marker into the nucleic acid of interest in die artificial 
chromosome by homologous recombination and the inactivation of the nucleic 
acid by this insertion. In other cases, one or both regions of homology in the 

1 5 linear DNAmolecule have a desired mutation that is introduced into the 
nucleic acid of interest in die artificial chromosome. If the aitificial 
chromosome is a BAG or PAC, the linear DNA molecule and either the BAC 
or PAC is introduced into bacteria to allow this homologous recombination to 
occur. If the artificial chromosome is a YAC, then the homologous 

20 recombination is performed in yeast. This method requires very little sequence 
information about the nucleic acid of interest. For example, homologous 
regions in the linear DNA firagment as short as 50 nucleotides have been shown 
to recombine with the corresponding regions in the artificial chromosome to 
generate the desired modified artificial chromosome. The resulting modified 

25 artificial chromosome is introduced into one or more cells (e.g., mouse ES 
cells), and the genetically modified cells in which homologous recombination 
occurs between the artificial chromosome and corresponding region of an ^ 
endogenous chromosome are selected. 
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Genomic PGR and blotting hybridization are two techniques that have 
been used in tiaditional methods to identify homologous recombination in ES 
cells. Genomic PGR is xisually only practical when the flanking region is less 
then-10 kilobases (Fig. 4A)r DNA blot analysis is more reliable but is also 
5 limited by the availability of appropriate restriction sites and gel resolution 
when very large fragments are introduced. Hence, neither method can be 
applied in general when an entire artificial chromosome is used as a targeting 
construct in the present methods. 

Thus, to use an artificial chromosome (e.g., BAG) for accurate gene 

10 replacement in ES or other donor cells, we had to find an effective method to 
distinguish homologous recombination from random insertion. We solved this 
problem by determining copy numbers of the target gene locus using 
fluorescence in sitii hybridization (FISH) with the entire artificial chromosome 
(e.g., BAG) as the probe. This approach allows one to visually distinguish 

15 random insertion-events,- which generate a locus for hybridization, from desired 
orthotopic targeting events, which maintain the two locus signature of the 
normal diploid cell. We have observed similar FISH resolution from 
interphase nuclei and from metaphase chromosome spreads. 

If desired, to reduce the number of FISH experiments that are 

20 performed, competitive PGR can be used to detect the vector sequences 

attached to both ends of the artificial chromosome (e.g., BAG) as a primary 
screening method to exclude random integration events. We found that most of 
the BAGs remained intact after electroporation into ES cells. In particular, 
50% of the G418 resistant colonies usually contained at least one vector end 

25 sequence. These clones all showed an extra signal when tested by FISH 
indicating that random integration ha d oc curred. Because &e vec^^^ 
attached to both ends of the BAG are very small and probably axe the first to be 
lost dming degradation, as many as 80% of the clones passing the initial PGR 
screening test can be false positives when detected by FISH. Nonetheless, the 

30 frequency of targeting is very high, and the procedure is quite simple. 
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An exemplary method for identifying correct targeting events using ^ 
PGR and FISH is illustrated in Fig. 12A. The BAG is linearized so that 
flanking vector arms remain on either side of the insert. G41 8-resistant 
colonies are first screened for the presence of these arms by PGR, and any 
5 colonies containing either of the vector segments are discarded. The remaining 
colonies are then screened by FISH. If the mutant BAG replaces one of the 
wild type gene loci, there should be no net gain in gene copy nimiber detected 
by FISH using the BAG as probe. Alternatively, undesired/randbm mtegration 
should result in the appearance of an additional hybridization signal. 

1 0 Independent knockout mice strains have been derived in which the ES 

clones passed the FISH screening test, suggesting the generality of the present 
strategy. In all cases, several positive clones were obtained from only one 96- 
well plate of colonies, indicating consistent high homologous recombination 
efficiency in ES cells, probably due to the extensive homology shared between 

15 the BAG and the targeted gene locus. We were able to obtain correctly 
targeted ES clones for four different genes in less than two months. 
Additionally, less than eight months was required to construct a Fanconi 
Anemia protein G (FANCG)-containing BAG and to obtain homozygous fancg 
"Imbckdut mice:"~FoOT^^ pdg/cbp^ and tab2 

20 have also been generated. All of the above five mutant lines have been bred to 
homozygosity by the above approach. In addition, two X-chromosomal genes, 
cask and maguin-1 have been targeted and since there is only one copy of them 
in ES cells, the mutant ES cells are knockout lines as judged by genomic PGR 
(see Figures 15 and 16). 

25 In summary, we have developed a method for rapidly generating 

genetically modified mammals. It involves two steps: modifying artificial 
chromosomes in bacteria or yeast via homologous recombination and using the 
mutant artificial chromosomes to modify ES cells or other cells. We have 
simplified the identification of orthotopic integration by using FISH with tiie 

30 entire BAG as the probe, To reduce the number of FISH experiments, we used 
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PGR to detect the vector sequences attached to the ends of the BAG as a 
primary screening method. The system can be used to modify BAGs or Pis 
without the need for special strains or libraries. In fact, the system can be used 
to modify aiiy artificial chromosome without the need for special strains or 
5 libraries (Yu et aL, Proc. Natl. Acad. Sci. USA 97:5978-83, 2000 and Zhang ' 
et al, Nat. Genet. 30:31-9, 2002). Competitive PGR and FISH are sufficient to 
screen for homologous recombination events in the cells. The procedure can 
be sheamlined and automated for high-thi-oughput site-specific knockouts 
production. In principle, it allows several genes to be inactivated 

10 simultaneously, which is especially attractive for knocking out families of 

functionally redundant genes. These methods also provide a simpler and easier 
solution for more sophisticated genomic manipulations, such as conditional 
knockouts, knock-ins, and large-scale chromosomal engineering (Ramirez- 
Solis et al. Nature 378:720-724, 1995 and Smith et aL, Oncogene 2002. 

15 21:4521-4529,1995). 

These methods are described further below. 

Overview of Present Methods for Generating Genetically Modified Mice 

As described above, the present method includes two steps, the 
20 modification of an artificial chromosome (e.g., a BAC, PAC, or YAC) in 
bacteria or yeast and gene targeting in the cells to be modified. Tlie general 
procedure for generating a modified BAG is illustrated in Figs. ID and IE; 
similar methods can also be used to generate other modified artificial 
chromosomes, such as PACs or YACs. In particular, a linear DNA fragment 
25 that has an antibiotic resistance gene flanked by two regions of homology to a 
nucleic acid, of interest is transformed into the BAC host. A plasmid that (i) 
encodes enzymes that mediate homologous recombination between a short 
linear DNA fragment and a circular plasmid and that (ii) contains a different 
antibiotic resistance gene than the linear DNA molecule is also transformed 
30 into the BAC host, and the expression of thie en2ymes is induced. Double 
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resistant colonies are screened for homologous recombination events by nested 
PCR. The DNA is extracted from these colonies and used to transform other 
bacteria. Cells containing the BAG but not tlie plasmid encoding the 
recombinases are selected based on the antibiotic resistance of the transformed 
5 cells. Correct targeting events are further confibcmed, if desired, by comparing 
restriction endonuclease digestion and DNA blotting. The resulting modified 
BAC is isolated, linearized, and transformed into the cells (e.g., ES cells) to be 
modified. 

Correct targeting is identified using PCR and fluorescence in situ 
10 hybridization. The principle is illustrated in Figs. 4B and 4C. When the 

modified BAC is linearized and electroporated into cells (e.g., ES cells), there 
are two short fragments from the vector attached to both ends. Homologous 
recombination results in the loss of these sequences, whereas random ^ 
integration often keeps them intact with the transgeue (Figs. 4B and 4C). 
...15 _ _Qenomic DNAis.extracted.from resistantxolonies.(i.e.,xoloniesxontaining the- 
antibiotic resistant gene from the modified BAC) and subjected to competitive 
PCR using primers specifically amplifying these fragments along with internal 
conti'ols primers. Colonies containing either of the vector fragments are 
discarded (Figs. 4B and 4C). 
20 Although competitive PCR can exclude random integration events, 

whether the remaining colonies have undergone homologous recombination is 
confirmed by FISH. If the modified BAC replaces one of the wild-type gene 
loci as desired, there should be no net gain in gene copy number detected by 
FISH using the BAC as probe. On the contrary, random integration results in 
25 an additional hybridization spot (Figs. 4B and 4C). 

Generating a targeting construct by modifying a BAC in E. coli 

For the generation of a linear DNA fragment containing an antibiotic 
resistance cassette flanked by two regions of homology, a plasmid designated 
30 pSKY was designed to contain a FRT site-flanked dual selection cassette 
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which includes an E7 promoter-driven zeocin resistant gene {zed) for positive^ 
selection in bacteria and a phosphoglycerate kinase (PGK) promoter-driven 
tied' for positive selection in ES cells. 

In particular, pSKY was constructed using a minimum backbone 
5 containing the ColEl replicon and the ampicillin resistant gene &om 
pBluescript (Stratagene). An Klzeo^ cassette was PCR amplified from 
pEM7/Zeo (Invitrogen) and ligated 5' to a PGKneo' cassette firom pGT-N28 
(NEB) with the" PGK polyX signal replaced wth the SV40 polyA signal and 
then flanked by FRT sites. Translation stop codons in three reading frames 

10 were inserted 5' to the Wzed cassette to terminate any potential translations 
from upstream. Convenient cloning sites were generated at both 5' and 3' 
ends. If desired, the zed'/ned' cassette can be removed through recombining : 
the flanking FRT sites by the site-specific recombinase Flp (Zhang et aL^ 
Nature Genet 20:123-128, 1998 and Muyrers et al. Nucleic Acids Res. 

15 27:1555-1557, 1999). 

A coding region of the gene of interest is identified by PCR 
amplification of mouse genomic DNA. This region can be as short as, e.g., 130 
base pairs. The primer pair amplifying this region (or the PCR product) is used 
to obtain a mouse genomic BAC clone containing the gene tto^ ** 

20 screening service provided by GenomeSystems (now a subsidiary of Incyte 
Pharmaceuticals). Then, a linear DNA fragment is constructed such that two 
short regions of homologous sequence from the identified coding region are 
placed on either side of the zed'/ned' cassette in plasmid pSKY (Fig, IB). In 
particular, this DNA fragment can be genemted by cloning the two regions into 

25 the 5 ' and 3 ' MCS sites, respectively, in vector pSKY followed by 

Notl/Hindin digestion. For example, homologous DNA fragments were either 
generated by PCR amplification or by synthesis and cloned into the 5' 
EcoRV/EcoRI sites and the 3' BamHI/Hindlll sites of pSKY. After 
sequencing confirmation, the plasnoids were digested with Nofl/Hindlll, and 

30 tiie recombination substrates were gel-purified using Qiagen gel extraction kit 
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(Qiagen). Alternatively, the two homologous regions can be tailed to primers 
amplifying the zeo^/mo^ cassette, and the linear fragment can he obtained by 
fusion PGR. The homologous regions can be as short as, e.g., 50 nucleotides 
and can reside on different exons so that the sequences between them are 
5 deleted. We have not reached a lower limit on the possible lengths of the ' 
regions of homology or an upper limit on the possible lengths separating the 
two regions. 

Because the DHIOB bacteria strain hosting flie BACs are recombination 
deficient, two shuttle vectors, pBADXredap and pBAD?ireda|iy , were 

10 constructed to express a partial or full-length phage Xred gene under the control 
of arabinose-inducible pBAD promoter (Fig. IC). The \ red gene encodes 
three subunits and can mediate homologous recombination between a short 
linear DNA fragment and a circular plasmid (Zhang et al.^ supra, Muyrers et 
al, supra, Poteete et al, Virology 134:161-167, 1984). Plasmid pBADA^redaP 

15 was also constructed using a minimum backbone containing the ColEl replicon 
and the ampicillin resistant gene from pBluescript (Stratagene). The X reda^ 
gene was PGR amplified from phage X DNA and cloned under the pBAD 
promoter along with the araC regulatory element from pBAD/His A 
(Invitrogen). 

20 High homologous recombination efficiency was achieved without 

utilizing the Xredy gene. The Xredy gene encodes Gam, which inhibits the 
exprmclease activity^of the RecB.CD complex in Kcpli and.prevents Ae Im 
DNA fragment from degradation. It is possible that liie endogenous RecBCD 
exonuclease does not degrade the linear DNA substrate as quickly as 

25 previously believed. Others have previously reported that gain or inhibition of 
recBC is noi required for recombination of linear DNA fragments with the E, 
coll chromosome. An alternative explanation is that its activity is inhibited in 
the process of making electroporation-competent cells (Karoui et al.^ Nucleic 
Acids Res. 27:1296-1299, 1999). 
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When the entire gene including the y subunit was used, we 
obtained 50 fold more double resistant colonies; nevertheless, the homologous 
recombination efficiency dropped 4 fold. Whether Xrerfy also promotes 
random insertion js-not-clear,-but high-fidelity-is desirable-when manipulating - 
5 BAC-sized firagments of DNA because of flie need to identify and eliminate 
clones with unwanted gene rearrangements that occurred during targeting. 

The pBADXred shuttle plasmid is transformed into BAC-containing 
DHIOB cells and bacteria resistant to both ampiciUin and chloramphenicol are 
made electroporation-competent after arabinose induction (Figs. ID and IE). 

10 The linear DNA fragment described above is electroporated into these cells, 
and its integration into the BAC renders the cells resistant to zeocin and 
chloramphenicol (Fig. IE). The double resistant colonies are screened for 
correct targeting events by nested PGR using two primers annealing outside the 
homology regions (P3, P4) paired with two primers annealing witliin the 
--15 zeo7neo'*cassetteXP-lvP2)._APCR-product,of.thexorrect.lengththatis_ , . 
amplified by primers PI and P3 indicates homologous recombination at the 5' 
end. Similarly, homologous recombination at the 3' end is verified by PGR 
using primers P2 and P4. The correctly modified BAC is separated firom the 
pBADA,redaP shuttle plasmid by electroporating the mixed DNA into DHIOB 

20 cells and selecting for colonies resistant to chloramphenical, but sensitive to 
ampiciUin (Fig. ID). 

To determine whether there are any random rearrangements in addition 
to the desired targeting, the digestion patterns of the wild-type BAG plasmid 
and the modified BAC plasmid, which contains additional restriction sites due 

25 to introduction of the Zed'/ned cassette, are compared (Fig. IE). Incorporation 
of flie antibiotic.resistance cassette can be. fiirtherjconfirmedi, if desired, by 
DNA Blotting using either the 5* or 3' homology region as a probe. 
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Targeting of the wild-type gene locus in ES cells using the modified BAC . 

For the genetic modification of ES cells, the modified BAC DNA is 
digested by NotI, and the genomic insert with two regions of BAC vector 
sequence left on botli ends is electroporated into the cells. Then, G41 8- 
5 resistant colonies are selected. Genomic DNA of these colonies is extracted 
and first screened by competitive PGR. Two pairs of primers are designed to 
amplify the BAC vector DNA left on either end of the linear firagment (Pa & 
Pb, Pc & Pd7-Figsr4B and 4e) . Ifhomologous recombination occurs,'thjese 
two regions of BAC DNA are eliminated and not amplified. If random 

10 integration occurs, these two regions of BAC DNA remain next to the genomic 
insert and are amplified by the BAC specific primers. Colonies whose 
genomic DNA contains the BAC vector specific fixigments are discarded. A 
pair of primers amplifying a larger fragment of mouse caspase-8 gene is used 
as an internal control in the same PGR reaction. 

15 DNA samples which do not contain the BAC vector specific sequences 

are subject to DNA Blotting using the same restriction endonuleases and either 
the 5' or 3' region of homology as a probe (Figs. 4B and 4C). Although this 
step cannot distinguish homologous recombination from random integration, 
this step is used to eliminate any clones with-ineorreet digestion patterns (Figs. 

20 4B and 4C). This step can also be used to select the restriction endonucleases 
and probes which are later used to identify homozygous mutant mice. 

A few of the remaining ES clones are analyzed using fluorescent in situ 
hybridization (FISET) with the whole BAC clone as a probe. If the right loctxs is 
targeted, the probe detects two copies of the BAC inserts in the metaphase 

25 nucleus in the mutant cells as well as in wild-type, control cells in which no 
BAC was introduced or in which a control BAC without a knockout cassette 
was introduced. However, if random integration occurs, more than two copies 
are detected (Figs. 4B and 4C). ES clones passing all tliree tests are injected 
into mouse blastocyst to make chimeras. 
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We were not able to demonstrate that one of the gene loci harbors the 
1 .5 kUobase zed/neo^ positive selection marker from the knockout cassette 
using two-color FISH with the zed/ned marker as a second probe. We suspect 
that the zed'Ined' marker is too short to be used as a specific probe for FISH. 
5 Although the presence of this cassette was not confirmed by two-color FISH, 
the neomycin resistance of the cells with the inserted BAG indicates that this 
marker is present. Longer positive selection markers may allow two-color 
FISH to be used to detect the target gene locus and the positive selection 
marker simultaneously, if desired. 

10 

Generation of Several Modified BACs and a Modified PAC 

To demonstrate that the present methods can be used to modify a variety 
of nucleic acids, a BAG containing a gene with a known genomic structure 
(e.g.,^w^:g) and several BACs containing genes with unknown genomic 

15 structures were modified with this procedure. As illustrated in Figs. 2A-2D, 
3 A, 3B, 8A-8D, and 9A-9D, primer Pzeo and Psv were used to anneal to the ' 
antibiotic resistance cassette. Primer Pzeo has the following sequence: 5'-GGC 
GAG GGT GTT GTG CGG GAG C-3'(SEQ ID NO: 1), and primer Psv has the 
following sequence: 5'-AAG GTT GGG GTT CGG AAT CG-3' (SEQ ID NO: 

20 2). Theprimersusedfor amplifying B AC backbone sequences include Pa: 5'- 
ACA GAT GGG TAA GGA GAA AAT AG-3' (SEQ ID NO: 3), Pb: 5'-CGC 
GGT ATA GTG AGT GGT ATT AC-3' (SEQ ID NO: 4), Pc: 5'-ATA GTG 
TCA GGT AAA TAG CTT GG-3' (SEQ ID NO: 5)," and Pd: 5'-GGG ACG 
ACA GGT TTC CCG AGT GG-3' (SEQ ID NO: 6). The primers used for 

25 targeting/ancg/xrccP include G34: 5'-CCG TCT TCC AGC GAG GGA GGG 
GG-3' (SEQ ID NO: 9), G35: 5'-GGC GAT ATC TGC CQT TGG TTC TAA- 
3XSEQ ID NO: 10), G36: S'-CCG AAT TCC AGG GTA GTG GAA A-3' . 
(SEQ ID NO: 1 1), G37: 5'-CCC GGA TCC CAC CTC CTC TCT AGG-3' 
(SEQ ID NO: 12), G39: 5'-GGC GAA GCT TTC TGA GCC TTT AGT-3' 

30 (SEQ ID NO: 13), G40: 5'-TGG CTA AAT TCA CTA AGT G-3' (SEQ ID 
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NO: 14), Gex5F: 5'-CCT CTG AGG ATC TGC TAG TAG TGC-3' (SEQ ID 
NO: 15), and Gex6R: 5'-GTG TAG ACC TGG ACT AAC AGG GAGr3' (SEQ 
ID NO: 16) (Yang et al. Blood 98:3435-3440, 2001). The primers used for 
targeting iJfcftras/ include Ikbras 1-1: 5--gggatggaagactgtgaaaogc-3' (SEQ ID 
5 NO: 17), Ikbrasl-2: 5'-gtctttgaacttatcaatctc-3' (SEQ ID NO: 18), Ikbrasl-3: 5'- 
aggtggcaatcgttgtgttagg-3' (SEQ ID NO: 19), and Ikbiasl-4: 5'- 
ctggatttgctctgggctgag-3' (SEQ ID NO: 20). The primers used for targeting 
dok3 inchide Dok3-l: 5*-gctggcgcaaagtgggctctg-3' (SEQ ID NO: 21), and 
Dok3-2: 5'-ccgggaaggccagctgacagatg-3' (SEQ ID NO: 22). The primers used 

10 for targeting tab2 include: Tab2-1 : 5'AATAACCTGGATGCGTGGTGCG3' 
(SEQ ID NO: 25), Tab2-2: 5'GAAGGTTGATAGGGTTGCTGAG3' (SEQ ID 
NO:26), Tab2-3: 5' TAATGCCATGAACCCTCAGGAAG3' (SEQ ID NO:27), 
and Tab2-4: 5' GGATGATCAGATCGGATACTGAG3' (SEQ ID NO:28). 
Primers for targeting pag/cbp include Pag-1 : 5 ' 

15 ATTCTTTCAGAAGAGAGCACGCTG3' (SEQ-ID-NO:-29), Pag-2: 5' - - 
GCGTGCACGGGTCCCTTCTGCAG3' (SEQ ID NO: 30), and Pag-3; 5' 
CGGGTCTTCTTCCCGAGACTTGTATG3' (SEQ ID NO: 31). Primers for 
targeting cask include Cask-1: 5'AGGTATTGGAAGAAATTTC3' (SEQ ID 
NO: 32), Cask-2: 5'GGCCTTACTTCAGACTCATG3' (SEQ ID NO: 33), and 

20 Cask-3:5'TTGGCTCATCCGTGTTCTTTTG3'(SEQIDNO:34). Primers 
for targeting maguin- 1 include maguin- 1 - 1 : 

5'AGGTCTTGATGACTGTTTGCAG3' (SEQ ED NO: 35), maguin-1-2: 
5'CAGTGCACACAAAAGGTCAACTG3' (SEQ ID NO: 36), maguin-1-3: 5' 
TTATGGCTTAGAAACAGAAATC3' (SEQ ID NO: 37), and maguin-1-4: 5' 

25 CCTGTCCAACCAAGCAAGAAGACTC3' (SEQ ID NO: 38). Primers for 
targeting cnk: cnkai5:ATGATGCCC:TACAGGACTATC3USE 
39), cnk-2: 5'AGGCAAAGCTGCAGGAACTG3' (SEQ ID NO: 40), and cnk- 
3: 5' AGGGACATCCCAGATCTGAG3' (SEQ ID NO: 41). Primers for 
targeting tim-3 include: Tim-3-1: 5'CTCTGGACTGCC ACTTTTAAAG3' 

30 (SEQ ID NQ: 42), Tim-3-2: 5'CCACAAACCTCTATG TCTCAAAG3* (SEQ 

36. 



wo 2004/013299 PCT/US2003/024322 

m NO: 43), and Tiin-3-3: 5*AGCCTTATTACACTGG CCAACTTG3' (SEQ< 
ID NO: 44). Primers for targeting tiap include PI: 
5'GTGGTGACGCCATCATGGGAGCTCCG3' (SEQ ID NO: 45), P2:5' 
CTCAGCTAATTATCGAG3' (SEQ ID NO:46), and P3: 5' CGGGTTGT 
5 CATCGGGTTCCCAG3' (SEQ ID NO:47). Contid primers include 5'-GAG 
GAG ATC TTT CCC TCA GGC^3'(SEQ ID NO: 23) and 5'-CAG AGG CTC 
TGA GTA AGA CC-3' (SEQ ID NO: 24). 

For the preparation of electroporation competent cells, a single BAG 
clone transformed with pBADXredaP was grown in 5 ml of LB medium (50 

10 |ag/ml ampicillin, 12.5 |ug/ml chloramphenicol) until reaching an ODeoo of 0.6 
and diluted into 500 ml of LB medium. The culture was induced with 0.1% L- 
Arabinose at an ODgoo of 0.2 and harvested when an OD^qq of 0.6 was reached. 
Cells were washed once each with 500 ml of ice-cold ddH20, 500 ml of ice- 
cold 0.4 mM HEPES (pH 7.5) solution, and 5 ml ice-cold 15% glycerol. Cell 

15 pellet was re-suspended in equal volimie of 10% glycerol and stored in — 80^C. 

For homologous recombination in E.coli, a linear DNA fragment (1 to 2 
^g) was electroporated into 100 ^il of BAG competent cells at 25 (iFD, 2.3 kV 
and 200 Q wifli a Bio-Rad Gene Pulser. Cells were immediately re-suspended 
in 1 ml of LB medium and inoculated at 3TC for 1.5 hours. Transformants 

20 were selected on LB plates containing 20 jig/ml zeocin (Invitrogen) and 12.5 
p,g/ml chloramphenicol (Sigma). Double resistant colonies were screened by 
standard colony PGR with primers to amplify both the wild-type and mutant 
locus. The reaction contains 0.2 mM dNTPs, 0.5 |iM of primers, and 1 unit of 
Taq polymerase (Roche) in 20 |li1 of total volume containing Ix reaction buffer 

25 with 1.5 mM MgCl2 (Roche). The cycling conditions were as follows: 94^C for 
3 minutes for 1 cycle; 94^C for 45 seconds, 55°C for 45 seconds, and 72°C for 
45 seconds for 30 cycles; 72°C for 5 minutes for 1 cycle. Reactions were 
performed on a Stratagene RoboCycler®. Standard restriction digestion and 
Southern Blotting were_performed.. 
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As summarized in Fig. 10, 10 different BACs were modified with an 
average efficiency of 60% (25% to 100%). The two homologous regions can 
reside, e.g., in one or two exons and can he as short as, e.g., 50 nucleotides. 
We have heen able to delete up to about 5 kilobases of sequence between the 
5 two homologous regions. If desired, even shorter regions of homology or even 
greater distances between the homologous regions may also be used. 

We have also successfully modified PI clones with this method. In 
addition, we have made conditional knockout constructs in three simple steps 
using a murine PI clone containing the tiap/survine gene. We first targeted the 
10 zeoVneo'' cassette into intron 1 (step 1, Fig. 3 A). We then transduced the 
homologous recombinants into a Flp expression strain, FLP/DKl. PI clones 
with the marker removed were isolated (step 2, Fig. 3 A). The zeoVneo'' cassette 
was re-introduced into intron 3 (step 3, Fig, 3 A). Products in each step were 
verified by nested PpR (Fig. 3B). 

15 

Genersition of fancg/xrcc9 Knockout Mice 

As described above, the zeoVneo'' cassette was introduced into a BAG 
clone containing the murine fancg/xrcc9 gene. The marker replaced exons 1 to 
10 with 480 nucleotides and 260 nucleotides of homologous sequences from 

20 exons 1 and 10, respectively (Figs. 2 A and 1 1 A). Two primers outside the 
homologous regions (G34 and G40) paired with two primers inside the 
zeo^/neo^ cassette (Pzeo and Psv) were used to screen double resistant colonies 
for correct targeting events at the 5' and 3' end, respectively, by nested PGR 
(Figs. 2B and 11 B). Among 24 colonies picked, all produced predicted PGR 

25 fi'agments at both ends. In addition, the PGR product ampHfied from exons 5 
and 6 was only observed in the wild type BAG, suggesting these exons were 
deleted in the modified BAGs (Fig. 2B). 

To determine whether any unintended rearrangements were generated 
during modification, the digestion patterns of EcoRV (i.e., a restriction site that 

30 is absent in the linear DNA fiagment) or Spel (i.e., a restriction site that is 
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unique in the linear DNA fragment) restriction sites of two independent 
modified B AGs were compared with those of the wild-type BAG. No gross 
rearrangements were detected in the modified BAGs. (Fig. 2C). DNA blotting 
using the 5 ' homologous region as probe demonstrated that there was a single 
5 copy of the marker integrated at the desired site (Fig. 2D). The fidelity of this 
procedure was flulher confirmed by shotgun sequencing of the modified BAG. 

This modified BAG was used to target the fancg/xrcc9 locus in ES cells. 
In particular, BAG DNA was purified using a Qiagen Large-Gonstmct kit, 
linearized by NotI digestion, extracted once using phenoliCHGls, precipitated, 

10 and resuspended in O.lx TE bufifer (pH 8.0) at 1 jag/jil. ES cells were cultured 
according to standard conditions (DePamphilis, ed. Methods in Enzymology, 
Vol. 225 Academic press, Inc., 1993), and 6x10^ ES cells were electroporated 
with 30 [ig of BAG DNA at 0.23 kV, 960 |iFD by a Bio-Rad Gene Pulser. 
Selection was started the second day after transfection with medium containing 

15 400 |Lig/ml 6418 (Invitiogen/Gibco). eolonies were picked into-96=^well plates. 
Genomic DNA was extracted according to standard protocols (DePanqphihs, 
supra). About 0.1 |ig of genomic DNA was used as templates in competitive 
genomic PGR. Cycling conditions were the same as described above. The 
control primers for competitive PGR include forward: 5 '-GAG GAG ATG TTT 

20 GCC TGA GGC-3 ' (SEQ ID NO: 7), and reverse: 5'-GAG AGG GTG TGA 
GTA AGA GG-3* (SEQ ID NO: 8). 

Among 96 PGR reactions, 38 showed both the control PGR firagment 
and the BAG vector firagment (e.g., G68 in Figs, 5A and 12B), 22 showed only 
the control PGR fragment (e.g., C38 and G52, Figs. 5A and 12B) and 36 

25 showed neither. We performed FISH on clones G38, G52, and G68 along wifli 
the wild type^ES cells using the-entire BAG as probe; For this analysis, 
standard FISH procedures were applied with minor modifications. For staining 
interphase nuclei, about 10^ cells were either directly seeded on a 10-well slide 
(Fisher) or on a regular slide (Fisher) by spinning in a Gytospin centrifuge. 

30 Slides were air-dried briefly and fixed in freshly made 4% paraformadehyde in 



wo 2004/013299 ,^ PCT/US2003/024322 

PBS (pH 7.4) for 10 minutes at room temperature. After rinsing ofif the excess 
fixative in PBS, the slides Were stored in 70% efhanol at 4°C. For staining 
metaphase spread chromosomes, the cells were treated, and the slides were 
prepared according to standard procedures. Either random priming (Prime-It® 
5 Fluor Fluorescence Labeling Kit, Stratagene, La Jolla, CA) or nick-translation 
(BioNick DNA labeling system, Invitrogen/Gibco) worked well- for labeling 
BACs. Under the condition we used, random priming yielded stronger signals. 
Manufacturers' protocols were followed and 35 jxg COT-1 DNA 
(Invitrogen/Gibco) were usually used to suppress nonspecific signals. The 

10 probe was dissolved in Hybrisol VI (Ventana, Tucson, AZ) and then denatured 
at 75^C for 10 minutes followed by incubation at 42°C for 1 hour. Slides 
seeded with 10^ cells per sample were fixed in 4% (wt/vol) paraformadehyde in 
PBS (pH 7.4) for 10 minutes at room temperature. Slides were then dehydrated 
two minutes each in 80%, 85%, 95%, and 100% ethanoL After air-drying, 

15 slides were then denatured in 70% (vol/vol) deionized fomiamide (American 
Bioanalytical, Natick MA), 2x SSC (pH 7.0) at 72''G for 10 minutes and 
dehydrated through ice-cold 70%, 80%, 95%, and 100% ethanoL The probe 
was adjusted to 10 ng/jil, and 4 iil was applied per sample area. Hybridization 
' ~ was carried overnight in a lfirinid"chalnbeTat"42°^ were washed'tHree ^* 

20 times in 2xSSC (pH 7.4) at 72°C for 7 minutes each and once in 0.2x SSC (pH 
7.4) at 72*^0 for 7 minutes. Signals were then amplified using TSA™ 
Fluorescence Systerfis (NEN™ Life Science Products, Boston, MA) following 
the manufacturer's protocol. Slides were finally counterstained with DAPI in ; 
4x SSC, mounted in Vectashield (Vector Laboratories, Inc., Burligame, CA), 

25 and observed under a Zeiss Axioplan 2 fluorescence microscope. 

Using this FISH analysis, we detected two signals in clones C38 and 
C52 as well as in the untargeted ES cells (Fig. 5B) and an extra signal in clone 
C68 (Fig. 5B), suggesting that C38 and C52 were correct targeting products 
and C68 was the result of random integration. 
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Although TSA-FISH has been previously reported to be reliable for the 
detection of less than 10 kb regions, genomic DNA hybridization was also 
perfoimed to exclude the possibility that G418 resistance could be ascribed to 
the randoni integration of a zeo/neo fragment that is too small to be detected by 
5 TSA-FISH (Fig. 12C). The expected patterns were found with all three probes, 
indicating that the zeoV/zeo'' marker had integrated into the fancg/xrcc9 locus 
and that the 20 kb surroimding genomic region had not imdergone any 
disruption other than the desired targeting event (Fig. 12C). As expected, DNA 
hybridization alone could not distinguish orfliotopic (C38 and C52) from 

10 ectopic (C68) integration events (Fig. 12C). We estimate that the region 
encompassed by the blotting data represents 1/5 to 1/7 of the intact BAG. 
Although large DNAs are known to be susceptible to breakage by 
hydrodynamic shear, the protocol used here minimizes the manipulation of the 
BAG DNA in vitro and to date no evidence of adventitious rearrangement of 

15 the targeting site has been observed. Gonsistent with this result, it has been 
previously reported by others that cells modified by gene targeting with one 
DNA fragment rarely simultaneously incorporate a second DNA fragment. 

To confimi that the ES clones identified by FISH represented correct 
targeting events, several ES clones were microinjected into C57BL6 

20 blastocysts, and mice were bred to homozygosity according to standard 

procedures (DePamphilis, supra). All mice strains were raised and maintained 
with standard protocols approved by Massachusetts General Hospital. For 
PGR genotyping, mouse tail genomic DNA was extracted according to 
standard protocols (DePamphilis, supra), and PGR reactions were performed 

25 as described for competitive PGR. We obtained one germline-transmission 
competent chimera for fancg/xrccQ . The resulting mice had homozygous 
mutations, as indicated by genomic PGR genotyping (Figs. 7A and 7B). To ' 
evaluate the locus integrity surrounding the targeted site, genomic markers 
within 100 kb flanking the DNA targeting site were amplified fi-om both wild 

30 type and mutant mice. As shown in Fig. 13G, all the markers tested are 
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preserved in the mutant mice, suggesting that an intact BAG can replace the 
endogenous gene locus by homologous gene recombination without 
adventitious deletion by luidesired recombination between repeated sequence 

elements-inthe-BAC-armSv — - . _ 

5 Phenotype analysis of fancg/xrcc9 deficient mice is described by Yang 

et aL, (Blood 98:3435-3440, 2001). In contrast, we did not obtain any correctly 
targeted clones from. 192 colonies using conventional procedures to target 
fancg/KrccP. 

10 Generation of additional Knock-out Mice — iKfirasl, dok^S/dok-L, Maguin- 
U Cask, CNK, and 2W-5 

BACs containing mutations in iKprasl, dok-3/dok-L, pag/cbp, and tab 2 
were also used to modify mouse ES cells as described above (Figs. 8A-8D, 9A- 
9D, and 13A-13C). We found that the PCR data did not always correlate with 
- -15 — the FISH data -For-example, in targeting the gene we discovered 

that clone CIO did not contain BAG vector sequences as judged by PCR (Fig. 
6A) but had three signals as observed by FISH (Fig. 6B). Tliese results suggest 
that FISH can determine true gene replacement events while competitive PCR 
only serves as a primary screening method to reduce the number of FISH 

20 experiments to be conducted. 

Several ES clones identified by FISH were injected into mice 
blastocysts and bred to produce homozygous mice. We have obtained viable 
mice homozygous for disrupted iKprasl, dok-S/dok-U and pag/cbp genes 
(Figs. 13A and 13B). Breeding of heterozygous TAB2 mutant mice has not 

25 resulted in viable homozygous offspring, but homozygous embryos have been 
detected at Day 9.5 (Figs. 13A and 13B). To date, all ES clones fhathave 
shown germ line transmission have yielded homozygous mice and the 
firequency of obtaining homozygous mice from ES cells targeted by thle 
approach (BAC-FISHing) is 100%. 
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We also targeted two X-chromosoma] genes, cask and maguin-L Since 
there is only one copy of them in ES cells, the wild type locus was disrupted in 
the mutant clones as judged by genomic PGR (see Figs. 15 and 16). 

Cnk and tim-S knockouts may also be generated using the same method. 
In this regard, heterozygous mice from two selected ES clones of each mutant 
genes may be bred to homozygosity. 

The results of several experiments are also summarized in Fig. 14. 
Approxiniately, 1-2x10^ G4T8 resistant colonieslcould generaily b^^ 
from electroporation of 10^ cells. For flie five genes studied, the average 
effective targeting efficiency was 15%, and the estimated absolute targeting 
efficiency was aroutid 10'^ without selection erdchment. Historical gene 
targeting efficiencies vary firom 10"^ to undetectable, depending on the locus < 
targeted, vector design, and selection methods (Bollag et al, Annu. Rev. Genet 
23:199-225, 1989). Some reports suggest that the average efficiency is closer 
to 10'^ to 10"^, which is the mean targeting efficiency of the well-studied hprt 
locus (Capecchi et ai. Science 244:1288-1292, 1989). Our mean ratio of 
homologous to nonhomologous recombination events is 1.5x10 ^ while the 
corresponding ratio for traditional methods is between 10'^ and 10'^ (Reid et 
^z/.,Mol. Cell Bioirn:2769^2777, 1991)". The efficiency aclSeved using the 
methods of the present invention is close to the homologous recombination rate 
found in nuclear extracts fiom murine embryonic fibroblast cells (Thyagarajan 
et aL, Nucleic Acids Res. 24:4084^91 , 1996). 

Generation of Conditiotial Knockouts, Knock-ins, and Chromosomal 
Alterations 

If desired, a cell can be genetically modified such that a deletion 
mutation only occurs in certain cells of a mammal generated using the 
genetically modified donor cell. In particular, the donor cell has two 
recombinase signal sequences (e.g., recombinase signal sequences within 
introns that flank one or more exons to be deleted)"and recombination occurs 
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between the recombinase signal sequences in cells of a predetermined cell type 
of the resulting mammal. For example, the donor cell may be genetically 
modified to encode a recombinase under the control of a promoter specific for 
the predetermined cell type such that the recombinase is only e3q>ressed and 
5 recombination only occms in that cell type of mammal. Altematively, a 
mammal modified to contain the reconibinase signal sequences can be mated 
with a mammal modified to express the recombinase \mder the control of a 
promoter specific for the predetermined cell type. 

We have created a tiap conditional knockout constmct using the 

10 approach described above. First, the FRT flanked zeo/neo marker was inserted 
into intron 1 (Fig. 17A, step 2) and correctly modified Pis were transduced into 
a bactezia host expressing the yeast site-specific recombinase flp to remove the 
zeo/neo marker (Fig. 1 7A, step 3). The FRT flanked zeo/neo marker was next 
inserted into intron 3 (Fig. 17 A, step 4). The FRT sites were therefore 

15 positioned into designated introns by sequential PI modification. The modified 
P 1 was next used to target ES cells and correctly targeted ES cells were 
selected by BAC-FISHing (Fig. 1 7B). All the FRT sites had been faithfully 
integrated into the ES genome (Fig. 17C). Clones A8 and B6 can be injected 
into blastocyst as described above. Following the generation of homozygous 

20 mice, mice may be bred into a mouse strain with a Ick-flp transgene to express 

flp. 

Knock-in mutations can be generated by including a nucleic acid such as 
a reporter gene or nucleic acid encoding a detectable protein between the 
regions of homology in the linear DNA firagment used to generate the modified 
25 B AC. Desirably, the reporter gene is integrated under the control of an 
endogenous promoter of interest. The nucleic acid encoding a detectable 
protein is desirably integrated in-ftame with an endogenous nucleic acid 
encoding a protein of interest for the generation of a fusion nucleic acid the 
encodes a fusion protein with the detectable protein and the protein of interest. 
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For the generation of chromosomal alterations, a genetically modified 
cell is generated with two recombinase signal sequences. If the recombinase 
signal sequences are in the same endogenous chromosome of the cell, 
recombination elimmates lfe^^ recombinase signal sequences.. 

If the recombinase signal sequences are in different endogenous chromosome 
of the cell, recombination results in chromosomal translocation between the 
recombinase signal sequences. Chromosome deletions can also be generated 
by using a linear DNA molecule that has two regions of homology that 
correspond to regions of the endogenous chromosome that flank the DNA to be 
deleted (e.g., a single gene or a cluster of genes to be deleted). The cells and 
mammals generated using this linear DNA molecule have the desired deletion 
mutation. If desired, this method can be repeated to delete an additional region 
beside the first deletion site. 

Optional Modifications to Furthe r Inc rease the Efficiency of the Present 
Methods 

To reduce the time required to generate bacteria containing both a BAG 
and the pBADXredap plasmid, the pBADXredap plasniid can be transferred to 
bacteria containing the BAG by conjugation. In particular, we have discovered 
that plasmids containing either the OriT region fi*om F' or the bom-mob 
complex from the ColEl plasmid can by mobilized by a male E, coli strain 
(e.g., XL-lBlue, an F'-bearing strain) to the BAG host strain DHIOB through 
conjugation. In both cases, tlie small plasmid is preferentially transferred 
relative to the mobilizing F\ To further reduce the contribution of episomal 
elements, the small plasmids can be introduced into an Hfr strain ofE, colL 
We ^ye generated pjasim &e j^B AI)XredaP plasmid or 
that contain bom-mob of ColEl, the pi 5 A replicon from plasmid pACYCl 84, 
and A,red ap or A,red ap plus Pl-Scel under the control of either the Ara BAD 
promoter or the lac uv5 promoter. These plasmids can be introduced into the 
BAG host by growing the two strains together for one hour and selecting for 
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ampicillin and chloramphenicol double resistant colonies, or kanamycin and , 
chloramphenicol resistant colonies, depending on flie resistance of the BAG 
plasmid. 

If desired, a temperature sensitive xeplicon can be used so that cells 
containing only the modified BAG can be isolated by killing cells containing 
the pBADXredap plasmid by shifting the cultures to a higher temperature. In 
particular, the GolEl replicon on the pBADA^redaP plasmid can be replaced 
with the temperature sensitive (ts) replicon fi^^ Cells with this 

pBADXredap plasmid are unable to grow at 42°G, so elevated temperatures 
can be used to separate this plasmid from the modified BAG. 

If desired, the linear DNA molecule used for generation of the modified 
BAG can be generated in vivo by cleavage of a circular fragment containing the 
sequence of the linear DNA fragment flanked by Scel sites. The VDE/PI-Scel 
system can be used according to the manufacturer's instructions for this 
procedure. Thus, a bacterial strain containing a BAG, a bacterial strain 
containing the pB ADT^teda^ plasmid, and a bacterial plasmid with the 
sequence of the linear DNA fragment flanked by Scel sites can be grown 
togetlier, and the BAG and the two plasmids can be transferred into the same 
bacteria by cbnjup:tibh and anti^^^ This method may eliminate 

the need for electroporation of competent cells and allow library screening and 
BAG modification to be performed in a single step. 

Generation of Other Genetically Modified Mammals 

The methods described above for the genetic modification of mice can 
generally be apphed to the genetic modification of any non-human mammal. 
In particular, an artificial chromosome (e.g., a BAG) containing a genomic 
insert from another manunal can be purchased or constructed using standard 
methods (Ausubel et aL, supra). The BAG can be homologously recombined 
with a linear DNA molecule containing a positive selection maiker flanked by 
-two regions with homology to a nucleic acid in the BAC as described above. 
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The resulting modified BAG can be used to alter the genome of cells fiom the 
same genus or species as the source of the genomic insert in the BAG. A 
variety of cells can be modified including embryonic, fetal, and adult 
differentiated or undifferentiated cells. The resulting modified cells can be 
5 used in any standard method for the generation of a cloned or chimeric 

mammal. For example, genetically modified ES cells can be injected into an 
embryo for the generation of chimeric mammals as described above. 
Alternatively, an embryonic or somatic cell or a nucleus fi-om the cell (e.g., an 
ungulate cell) can be inserted into an oocyte (e.g., an enucleated oocyte). After 

1 0 activation of the oocyte, the oocyte is placed in culture medium for an 

appropriate amount of time to allow development of the resulting embryo. At 
the two cell stage or a later stage, the embryo is transferred into a foster 
recipient female for development to term (see, for example, U.S. Patent 
Numbers 4,994,384; 6,077,710; 5,453,366; 5,945,577; 6,258,998; and 

15 5,057,420; Wakayama et al, PNAS 96: 14984-14989, 1999; Wakayama ei al. 
Nature Genetics 24: 1 08-1 09, 2000; and Stice and Keefer, Biology of 
Reproduction 48: 715-719, 1993). 

Exemplary Applications of the Present Methods 

20 The mammals produced by any of the methods of the present invention 

can be used as animal models to identify candidate compounds that modulate 
the expression of a nucleic acid or protein of interest or that are useful for the 
treatment or prevention of a disease. For example, a mammafcan be 
genetically modified to express a reporter gme under the control of a promoter 

25 of interest by the integration of a knock-in cassette containing a reporter gene 
that is flanked by a region homologous to the promoter of interest and another 
region of homology. Gandidate compounds are administered to this mammal > 
to determine whether they modulate the activity of the promoter of interest and 
flius are usefiil for the treatment of a disease associated with a nucleic acid 

30 operably linked to the promoter of interest. Alternatively, a mammal can be 
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genetically modified to e3q>ress a fusion protein that includes a detectable 
protein and a protein of interest. This mammal is generated, e.g., by the 
integration of a knock-in cassette containing two regions homologous to a 
nucleic acid of interest (i.e., tiie nucleic add encoding the protein of interest). 
5 flanking a nucleic acid encoding the detectable protein. Candidate compounds 
are administered to tliis mammal to determine if they modulate the expression 
of this fusion protein in vivo. Such compounds may be useful for the treatment 
of a disease associated with the protein of intereist. 

The present methods can also be used to generate animal models of 

10 various diseases. In particular, one or more mutations associated with a disease 
can be introduced into a mammal (e.g,, a mouse, xmgulate, or primate) as 
described herein. Candidate compounds can be administered to these 
m£immals to determine whether the compounds ameliorate or prevent a 
symptom or other physiological efifect associated with the disease. Exemplary 

15 mutations associated with cancer or diabetes are described by Ruediger et al 
(Oncogene 20:10-15, 2001) and Yokoi etal (Nature Genetics 31:391-394, 
2002), respectively. Chromosomal translocations associated with cancer (e.g., 
acute myelogenous leukemia) are also described by the following references: 
Kuehl et at. Nature Reviews Cancer 2(3): 175-87, 2002; Dyer et al. Leukemia 

20 16(6):973-84, 2002; Gojo et al. Cancer Treatment & Research 1 08:231-55, ' 
2001; Padua et al, Cancer Treatment & Research 108:1 1 1-57, 2001; Falini et 
al, Blood 99(2):409-26, 2002; Stevenson et al, Advances in Cancer Research 
83:81-1 16, 2001; Swerdlow et al, Human Pathology 33(l):7-20, 2002; 
Chakraborty et al. Journal of Cellular Biochemistry 82(2):310-25, 2001; Kelly 

25 et al. Current Opinion in Oncology 14(1): 10-8, 2002; Falini et al, British 
Journal of Haematology 1 14(4):741-60, 2001; Davila et al. Advances in 
Cancer Research 81:61-92, 2001; Sturzl et al. Advances in Cancer Research 
81:125-59, 2001; Arvand etal. Oncogene 20(40): 5747-54, 2001; Barr etal. 
Oncogene 20(40) :5 73 6-46, 2001; Seidel etal. Oncogene 20(40):5718-25, 

30 2001; Aspland et fl/., Oncogene 20(40): 570 8- 17, 2001; Alcalay et al, 
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Oncogene 20(40):5680-94, 2001; Licht et al. Oncogene 20(40):5660-79, 2001; 
Duyster et al. Oncogene 20(40): 5623-37, 2001; Bergsagel et al. Oncogene 
20(40):561 1-22, 2001; Boxer et al. Oncogene 20(40):5595-610, 2001; 
Kuppers et al. Oncogene 20(40):5580-94, 2001; Mullauer et al. Mutation 
5 Research 488(3):21 1-31, 2001; Morris et al, British Journal of Haematology 
113(2):275.95, 2001; Lamorte etal, Surgical Oncology Clinics of North 
America 10(2):271-88, viii, 2001; Manx, International Journal of Hematology 
73(3):308-22i 2001^-^Sataer-e^a/.rIntemational-Joumal-of-Hematology- 
73(3):278-91 , 2001; Holyoake, British Journal of Haematology 1 13(1):11.23, 
10 2001; Bower, British Journal of Haematology 112(4):863-73, 2001; Crans et 
al. Leukemia 15(3):3 13-31, 2001; Lee, British Journal of Haematology 
111(4):993-1009, 2000; Mavrothalassitis etal. Oncogene 19(55):6524-32, 
2000; Garcia-Manero et aL, Hematology - Oncology Clinics of North America 
14(6): 138 1-96, x-xi, 2000; Larson, Hematology - Oncology Clinics of North 
15 America 14(6):1367-79, x, 2000; Thiebaut aL, Hematology - Oncology 

Clinics of North America 14(6): 1353-66, x, 2000; Durrant et aL, Hematology - 
Oncology Clinics of North America 14(6):1327-52, 2000; Faderl et aL, 
Hematology - Oncology Clinics of North America 14(6): 1267-88, 2000; 

.Drexler et aL, Leukemia 14(9„):1533-5§,,2QO0; Pui^^^./3f/. , British. JournaLof 

20 Haematology 109(l):13-23, 2000; and Dierianmi et aL, Hematological 
Oncology 18(1):1-13, 2000. 

The genetically modified mammals of the invention are also useful for 
target validation to confirm that a nucleic acid of interest is associated with a 
disease, disorder, or condition. For example, one or more symptoms associated 
25 with the disease, disorder, or condition can be compared between a mammal 
having a mutation in the nucleic acid of interest or having a mutation that alters 
the expression of a nucleic acid of interest (e.g., a mutation in a promoter or the 
insertion of a exogenous promoter) and a control mammal without the 
mutation. Exemplary nucleic acids include nucleic acids tliat are thought to 
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promote cancer such as possible oncogenes; genes that enhance cell 
proliferation, invasion, or metastasis; genes that inhibit apoptosis; and pro- 
angiogenesis genes. 

Moreover, the present methods can be used as an alternative to 
5 conventional gene therapy methods to correct a mutation associated with a 
disease in cells from a patient (e.g., a human) that are subsequently 
readministered to the patient or administered to another patient (Kyba et al. 
Cell. 109(l):29-37 2002; Hideout et al, Cell. 109(1): 17-27, 2002). In these 
applications, cells (e.g., cells from a human patient such as primary cells, bone 

10 marrow cells, or blood cells) are genetically modified to replace a region of 
nucleic acid that has one or more mutations associated with a disease with a 
nucleic acid segment without the mutation(sj. This genetic modification 
procedure can be repeated, if desired, to correct both alleles of the nucleic acid. 
Then, standard mefliods can be used to administer the modified ceils into the 

1 5 appropriate., area of the patient. 

Other Embodiments 

From the foregoing description, it will be apparent that variations and 
modifications may be made to the invention described herein to adopt it to 
20 various usages and conditions. Such embodiments are also within the scope of 
the following claims. 

All publications mentioned in this specification are herein incorporated 
by reference to the same e5fteiit as if each ihdep"^^ piiblicatibn wpatent 
application was specifically and individually indicated to be incorporated by 
25 reference. 

What is claimed is: 
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